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We perform a timing analysis of our current implementation of the algorithm described in [T] . Our 
implementation performs the classical processing associated with an n x n lattice of qubits realizing 
a square surface code storing a single logical qubit of information in a fault-tolerant manner. We 
empirically demonstrate that our implementation requires only 0{n^) average time per round of 
error correction for code distances ranging from 4 to 512 and a range of depolarizing error rates. 
We also describe tests we have performed to verify that it always obtains a true minimum weight 
perfect matching. 



We assume familiarity with the surface code [H [3], 
fault-tolerant schemes built on the surface code [3]-[S] and 
the classical processing algorithm described in [1]. Our 
goal is not to repeat existing discussion, rather to pro- 
vide additional data on the performance and correctness 
of our implementation of this algorithm. This implemen- 
tation is called scode (ess-code). 

Scode consists of four layers of software — simula- 
tor, problem preparer, problem solver, data gatherer. 
The simulator executes no initialization surface code er- 
ror detection with depolarizing noise (see [7]). Random 
Pauli errors are generated and propagated using a Pauli 
frame. When errors lead to syndrome measurement value 
changes, graph vertices are generated at these space-time 
locations by the problem preparer. By pre-analyzing all 
possible single error processes [7] , an underlying lattice of 
dots and lines is also prepared with dots at every location 
a vertex could potentially be generated and lines between 
every pair of locations that could have vertices generated 
by a single error. The first order probability pn^c of each 
line is calculated and a weight w = — In(piinc) stored in 
each line. The rationale for doing this can be found in 
[7] . A lattice of dots and lines with associated probabilis- 
tically generated vertices (from surface code simulation) 
is shown in Fig. [T] 

In many ways, a lattice plus vertices can be considered 
an implicit complete graph with an edge between any pair 
of vertices having weight equal to the minimum weight 
path between those vertices. The task is to match all 
vertices in pairs or to neighboring boundaries such that 
the total weight of all match paths is minimal. The ba- 
sic algorithm that efficiently solves this problem given a 
standard graph is the minimum weight perfect matching 
algorithm [8l |9] . We have extended this algorithm to in- 
clude the concept of boundaries and permit new vertices 
to be dynamically added to the graph. 

We have two operational versions of extended min- 
imum weight perfect matching — complete match [7 
which firstly constructs explicit edges between all pairs 
of vertices no more than approximately d rounds of error 
correction apart, and edges on demand match [I] which 
only constructs a small number of local edges and adds 
further edges to the problem as required. The graphs 
and matchings generated by complete match (cmatch) 
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FIG. 1: Distance 4 example of a lattice of dots and lines with 
associated vertices. Dots (small silver balls) correspond to 
space-time locations where the endpoints of error chains could 
potentially be detected. Vertices (large red balls) correspond 
to space-time locations where error chain end points have been 
detected. Silver lines link pairs of dots where a pair of vertices 
could be generated by a single error. Green lines link spatial 
boundaries to a single dot where a single vertex could be 
generated by a single error. 



and edges on demand match (eodmatch) given Fig. [T] as 
input are shown in Fig.[2]and Fig.|3]respectively. The to- 
tal weight of matched edges in both cases is identical and 
in this case the matchings themselves are identical. We 
have tested cmatch and eodmatch on millions of varied 
problems, large and small, and always observed identical 
total weights, strongly implying both implementations 
are correct. 

Cmatch obtains the true minimum weight perfect 
matching despite only including edges between vertices 
separated by a finite number of rounds. Vertices sep- 
arated by a very large number of rounds are always 
cheaper to match to their nearest boundaries than to 
one another. By using the weights of the lines in the lat- 
tice, we calculate the minimum span of rounds to connect 
with edges to guarantee a minimum weight matching. 
Eodmatch also obtains a true minimum weight perfect 
matching as any required edge will eventually be included 
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FIG. 2: Output of cmatch when given Fig. [T] as input. The 
underlying lattice is used to construct a complete graph and 
then discarded. Edges in the matching are shown in red. 




FIG. 3: Output of eodmatch when given Fig.[T]as input. Note 
that only one edge other than those ultimately included in the 
matching has been created (just visible between the top right 
two vertices). 



during execution. 

Further evidence of the correctness of eodmatch conies 
from studying the probability of logical error per round 
of error correction (pl) at depolarizing probabilities p 
well below threshold. We calculate by simulating 
t_check rounds of faulty quantum computer operation, 
then turning off errors, capping the matching problem 
with a perfect round of error correction, applying cor- 
rections, checking whether we have an odd or even num- 
ber of errors along one of the boundaries and recording 
whether this is different to the previous time we checked. 
The perfect round of error correction is then undone and 
another t_check faulty rounds simulated and the process 
repeated. 

It may seem that the ideal value of t .check is 1 to en- 
sure that no logical errors are missed, however this is not 
the case. We have observed that many combinations of 
errors lead to the observation of a logical error if a per- 
fect round of error correction is inserted halfway through 
it, but no logical error if the perfect round of correc- 
tion is sufficiently distant. With frequent checking this 
can mean a benign pattern of errors is counted as several 
logical errors. Instead, we typically use a value of t_check 



such that a change in the parity of the number of errors 
observed along a boundary occurs approximately 10% of 
the time. We have empirically found that this leads to 
a logical error rate estimate robust to wide variations of 
t_check about this value. The probability of a change 
per check is equal to the probability of an odd number of 
logical errors in t_check rounds enabling to be easily 
calculated. 

A distance d code can reliably correct [(d— 1)/2J errors. 
At low error rates p clusters of errors are well separated. 
The probability of suffering a logical error inducing clus- 
ter of Ud = [{d+ 1)/2J errors should therefore be 0{p'^'') 
if the full distance of the code is being reahzed. Figs. [4]- 
[5] show the complete set of data we have collected for 
the square surface code. Polynomials Adp"'"' are drawn 
through the lowest data point we were able to obtain for 
distances 3, 5, 7 and 9. 

It is computationally expensive to obtain statistics at 
very low error rates and high distances as very few logi- 
cal state changes are observed. It is also computationally 
expensive to obtain data at high error rates and high dis- 
tances as the minimum weight perfect matching problem 
becomes more difficult around and above the threshold 
error rate (0.9% [1]). The raw data used to generate 
Figs. |4||5j including timing information, can be found in 
Appendix 

The distance 3 and 5 dashed asymptotic curves in 
Figs. |4] [5] agree very well with the data. For higher dis- 
tances, it is not currently possible to simulate a suffi- 
ciently large number of rounds of error correction to ob- 
tain sufficient information at low enough distances and 
achieve such tight agreement. Note that the high dis- 
tance data curves approach the asymptotic curves with 
a steeper gradient, implying the surface code is capable 
of regularly correcting temporal clusters of errors con- 
taining more errors than the maximum guaranteed to be 
correctable. This is a generic feature of topological quan- 
tum error correction, as a large cluster of errors widely 
scattered across the code is not dangerous provided the 
errors are sufficiently sparse. 

The timing information in Appendix [X] includes every- 
thing — initial bootup of the simulation, the simulation 
of the underlying quantum computer, problem genera- 
tion, matching, perfect rounds of error correction to en- 
able logical state change detection, and maintenance of 
an appropriate Pauli frame. Figs. |6]shows the amount of 
time devoted to each round of matching alone at three 
different error rates for distances c? = 4, 8, 16, . . ., 512. 
The quadratic scaling of required time with distance is 
well demonstrated. At small d nearby boundaries prevent 
the growth of large blossoms leading to increased perfor- 
mance. At very high d memory access effects lead to a 
slight slowdown. Note that real computer systems are 
too complex to provide perfectly smooth graphs of time 
scaling even with long time averaging as the interplay of 
different levels of cache and RAM leads to measurable 
deviations from the ideal scaling. 

To illustrate the complexity of modern computer mem- 



3 




Depolarizing probability (p) 



FIG. 4: Logical Z error rate per round of error correction for surface code distances d and depolarizing noise probabilities p. 
Dashed lines indicate expected low p asymptotic curves for d = 3, 5, 7 and 9. 




Depolarizing probability (p) 



FIG. 5: Logical X error rate per round of error correction for surface code distances d and depolarizing noise probabilities p. 
Dashed lines indicate expected low p asymptotic curves for d = 3, 5, 7 and 9. 



ory systems, we have generated increasingly large arrays 
of random integers and calculated the time required to 
swap a constant large number (10^^) of randomly cho- 
sen pairs of integers. The results are shown in Fig. [7] 
Ideally, a swap operation should be 0(1) independent of 
the array size. In practice, it can be seen that larger 
data sets lead to lower performance as CPU cache is ex- 
ceeded. The data in this manuscript was generated by 16 
core Intel Xeon 3.33GHz CPUs with 12MB of cache. Our 
matching code is more complex than this simple demon- 
stration, with gradual delocalization of data as the data 



set increases in size. This leads to a gradual reduction of 
the probability of a single memory page load containing 
additional useful data. 



To summarize, after accounting for low distance nearby 
boundaries which limit the complexity of matching and 
high distance slower memory access. Figs. J6] provide 
strong evidence supporting the claimed 0((i^ runtime 
of our implementation of the algorithm described in [T]. 
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FIG. 6: Amount time in seconds devoted to each round of matching when simulating a distance d single logical qubit square 
surface code for depolarizing error rates p. Quadratic curves have been included for reference. 
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FIG. 7: Average time in seconds required to perform 10 swaps of randomly chosen pairs of integers in arrays of increasing 
size. 
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The raw data used to generate Figs. |4]^ including 
timing information, is listed below. The first number is 
the number of different distances (14). The second is 
the distance d of the following block of data, the third is 
the number of different values n of the depolarizing error 
rate p. The next n lines list the value of p, t_check, the 



number of checks for Zj^ state changes, the number of 
checks for state changes, the observed number of Zj^ 
state changes, the observed number of state changes 
and finally the total number of CPU seconds devoted to 
the simulation of that (d, p) pair. This basic structure is 
repeated for each distance. The last number in the file is 
the total number of CPU seconds devoted to creation of 
the entire file. 

14 

3 

15 

0.0001 25780 93997 93997 11754 10000 37685.82 
0.001 261 94887 94886 11810 10000 415.42 
0.002 70 93413 93413 12015 10000 228.69 
0.005 12 91697 91696 11540 10000 33.62 
0.0065 7 97198 97197 12015 10000 30.32 
0.007 5 113854 113853 11598 10000 25.31 
0.0075 4 121578 121577 11670 10000 24.84 
0.008 3 142693 142692 11681 10000 26.08 
0.0085 2 184890 184889 11830 10000 29.62 
0.009 2 164349 164348 11696 10000 26.93 
0.0095 3 107513 107513 11496 10000 28.59 
0.01 3 98958 98957 11523 10000 21.47 
0.012 2 103971 103970 11759 10000 22.25 
0.015 1 125634 125633 11536 10000 27.58 
0.05 1 31051 31050 11478 10000 28.47 
5 

14 

0.001 3861 106980 106979 13628 10000 47490.43 
0.002 314 151251 151251 13637 10000 6568.45 
0.005 26 111948 111947 12953 10000 653.26 
0.0065 9 142249 142248 12523 10000 416.50 
0.007 6 165703 165702 12626 10000 359.61 
0.0075 6 137318 137317 12553 10000 308.21 
0.008 6 114309 114308 12021 10000 325.45 
0.0085 4 137468 137467 12061 10000 329.63 
0.009 3 151197 151196 12121 10000 336.85 
0.0095 3 131364 131364 11914 10000 304.38 
0.01 3 116216 116215 11896 10000 337.23 
0.012 2 103659 103658 11950 10000 332.92 
0.015 1 101838 101837 11265 10000 269.22 
0.05 1 23807 23806 10975 10000 544.14 
7 
14 

0.001 65493 124652 124652 16178 10000 3521570.27 
0.002 3731 114602 114602 15837 10001 165464.09 
0.005 88 89778 89777 13864 10000 6818.24 
0.0065 13 178531 178530 13492 10000 3272.27 
0.007 18 102587 102586 13184 10000 1624.85 
0.0075 10 131935 131934 13115 10000 1651.60 
0.008 6 157820 157819 12635 10000 1387.70 
0.0085 5 148477 148476 12683 10000 1220.78 
0.009 4 146237 146236 12497 10000 1470.92 
0.0095 4 120724 120724 12117 10000 2172.96 
0.01 3 126889 126888 12183 10000 1086.01 
0.012 1 157182 157181 11622 10000 1510.58 
0.015 1 82346 82345 11007 10000 1145.62 
0.05 1 21522 21521 10388 10000 1717.54 
9 
13 

0.002 31862 116333 116333 17719 10001 4289126.50 
0.005 134 134774 134773 14912 10000 34427.08 
0.0065 31 135372 135371 14001 10000 9139.10 
0.007 17 159583 159582 13532 10000 6835.25 

0.0075 13 146483 146482 13431 10000 5371.73 
0.008 6 206359 206358 12936 10000 4678.65 
0.0085 6 157382 157381 12947 10000 3371.82 
0.009 6 122371 122370 12735 10000 2893.09 



0.0095 3 163207 163207 12108 10000 2943.78 
0.01 2 177919 177918 12051 10000 5089.92 

0.012 1 140234 140233 11265 10000 4300.04 
0.015 1 68981 68980 11432 10000 3114.11 
0.05 1 20548 20547 10181 10000 5762.83 
11 

13 

0.002 10000 2970715 2970715 22288 10014 60628087.09 

0.005 442 102391 102390 16041 10000 116498.16 

0.0065 39 177762 177761 14738 10000 27844.55 

0.007 34 126474 126473 13882 10000 26712.36 

0.0075 18 149239 149238 13749 10000 15172.28 

0.008 10 170139 170138 13115 10000 9234.01 

0.0085 8 144574 144573 12947 10000 10480.43 

0.009 3 237671 237670 12947 10000 9925.01 

0.0095 3 171695 171695 12202 10000 9361.08 

0.01 4 107858 107857 11928 10000 6497.89 

0.012 1 126752 126751 11515 10000 7390.47 

0.015 1 58162 58161 11174 10000 4262.45 

0.05 1 19984 19983 9936 10000 12520.50 

13 

12 

0.005 937 114572 114572 16980 10000 603790.28 
0.0065 58 192423 192422 15266 10000 78381.30 
0.007 32 193240 193239 14502 10000 50053.45 
0.0075 26 143566 143565 14055 10000 34005.52 
0.008 12 180949 180948 13613 10000 22861.61 
0.0085 10 139111 139110 13025 10000 21325.58 
0.009 5 172414 172413 12920 10000 20014.77 
0.0095 5 121057 121057 12400 10000 13410.58 
0.01 3 130958 130957 12065 10000 11380.03 
0.012 1 115078 115077 11931 10000 10931.53 
0.015 1 50282 50281 11036 10000 6670.03 
0.05 1 19964 19963 10009 10000 28043.39 
15 
12 

0.005 2456 102892 102890 17492 10001 2173892.01 
0.0065 151 124143 124143 15471 10000 222593.70 
0.007 52 176282 176282 15132 10000 115430.02 
0.0075 28 174215 174215 14415 10000 78240.36 
0.008 17 161912 161912 14015 10000 52890.79 
0.0085 11 148576 148576 13102 10000 54511.94 
0.009 5 181422 181422 12771 10000 34858.01 
0.0095 4 146241 146241 12416 10000 30968.27 
0.01 3 129322 129322 12253 10000 28182.51 
0.012 1 101864 101864 11498 10000 17875.60 
0.015 1 44247 44247 10863 10000 11634.90 
0.05 1 19864 19864 9903 10000 98384.02 
17 
12 

0.005 5885 101613 101613 18732 10000 6704496.11 
0.0065 275 111939 111939 16021 10000 516700.36 
0.007 89 149029 149029 15277 10000 256345.82 
0.0075 54 123138 123138 14291 10000 126590.64 
0.008 23 148027 148027 13961 10000 82716.06 
0.0085 15 127265 127265 13161 10000 56207.63 
0.009 5 191569 191569 12961 10000 45131.54 
0.0095 4 148226 148226 12348 10000 34119.88 
0.01 2 169382 169382 12010 10000 46805.81 
0.012 1 89938 89938 11408 10000 24677.61 
0.015 1 39250 39250 10892 10000 17671.96 
0.05 1 20013 20013 9818 10000 132420.21 
19 
10 

0.0065 420 115409 115409 16474 10000 963439.12 
0.007 163 121377 121377 15659 10000 533590.20 
0.0075 69 125702 125702 14879 10000 269356.52 
0.008 46 94467 94467 13698 10000 144521.98 
0.0085 20 114779 114779 13936 10000 82840.47 
0.009 8 142942 142942 12913 10000 60109.55 
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0.0095 4 146009 146009 12300 10000 55487.99 
0.01 3 123422 123422 12369 10000 47540.48 

0.012 1 81072 81072 11411 10000 35343.85 
0.015 1 36705 36705 11159 10000 28088.66 
21 
10 

0.0065 859 91469 91463 16689 10000 2147913.54 
0.007 226 124800 124798 16152 10000 1191853.64 
0.0075 107 108172 108172 15048 10000 472320.02 
0.008 44 118209 118209 14480 10000 276634.69 
0.0085 19 128841 128841 13654 10000 125697.36 
0.009 8 144541 144541 12716 10000 93677.72 
0.0095 3 182294 182294 12630 10000 170367.06 
0.01 3 115647 115647 11821 10000 63634.76 
0.012 1 75689 75689 11377 10000 52612.18 
0.015 1 33355 33355 11109 10000 40737.04 
25 
5 

0.008 55 128564 128564 14352 10000 507860.33 

0.0085 17 165192 165192 13888 10000 231570.51 

0.009 9 143332 143332 13211 10000 175110.91 

0.0095 3 173140 173140 12352 10001 163829.39 

0.01 199011. 04 

35 

5 

0.008 88 167587 167587 15592 10000 2801803.43 

0.0085 14 269115 269115 14131 10000 1292528.08 

0.009 7 176942 176942 12822 10000 550365.22 

0.0095 4 125106 125106 12314 10001 437366.43 

0.01 2 34 34 3 1 61.08 

45 

5 

0.008 230 128455 128422 15969 10000 11334092.96 

0.0085 27 203196 203196 14039 10000 2845252.53 

0.009 9 147701 147701 12962 10000 1587903.33 

0.0095 3 125196 125196 11684 10001 944033.23 

0.01 1 13 13 1 16.30 

55 

5 

0.008 197 285003 284944 18433 10489 37296797.55 

0.0085 34 210799 210785 14340 10000 7669758.24 
0.009 7 170072 170072 12541 10000 3025099.15 
0.0095 2 140843 140840 11836 10000 2813159.55 
0.01 1 66 66 9 1 2038.71 
161469013.53 

For convenience, we also include below the processed 
raw data with checks and changes converted into proba- 
bilities of logical error. Second column is the probability 
of Zl error per round of error correction. Third column 
is the same data for Xj,. 
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